Speech Intelligibility Enhancement Using Convolutive Non-negative Matrix Factorization with Noise Prior
نویسندگان
چکیده
We propose a convolutive non-negative matrix factorization method to improve the intelligibility of speech signal in the context of adverse noise environment. The noise bases are prior learned with Non-negative Matrix Factorization (NMF) algorithm. A modified convolutive NMF with sparse constraint is then derived to extract speech bases from noisy speech. The divergence function is selected as an objective function to get a multiplicative update of speech base and its corresponding weight. The weights of prior learned noise bases are also updated in the update rule. Listening experiments are conducted to assess the intelligibility performance of speech synthesized using the proposed algorithm. Experimental results indicate that the proposed method is very effective to improve the intelligibility of the noisy speech in various noise contexts and it outperforms conventional algorithms.
منابع مشابه
Speech enhancement using convolutive nonnegative matrix factorization with cosparsity regularization
A novel method for speech enhancement based on Convolutive Non-negative Matrix Factorization (CNMF) is presented in this paper. The sparsity of activation matrix for speech components has already been utilized in NMF-based enhancement methods. However such methods do not usually take into account prior knowledge about occurrence relations between different speech components. By introducing the ...
متن کاملSpeech Enhancement Using Sparse Convolutive Non-negative Matrix Factorization with Basis Adaptation
We introduce a framework for speech enhancement based on convolutive non-negative matrix factorization that leverages available speech data to enhance arbitrary noisy utterances with no a priori knowledge of the speakers or noise types present. Previous approaches have shown the utility of a sparse reconstruction of the speech-only components of an observed noisy utterance. We demonstrate that ...
متن کاملRobust speech recognition in multi-source noise environments using convolutive non-negative matrix factorization
Convolutive non-negative matrix factorization (CNMF) is an effective approach for supervised audio source separation. It relies on the availability of sufficient training data to learn a set of bases for each acoustic source. For automatic speech recognition (ASR) in a multi-source noise environment, the varied nature of background noise makes it a challenging task to learn the noise bases and ...
متن کاملOn-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise
Speech de-noising algorithms often suffer from introduction of artifacts, either by removal of parts of the speech signal, or imperfect noise reduction causing the remaining noise to sound unnatural and disturbing. This contribution proposes to spatially distribute monaural noisy speech signals based on single-channel source separation, in order to improve the perceived speech quality. Stereo u...
متن کاملStudy of Multiple Dictionaries in Exemplar-based Nmf for Speech Enhancement
Growing in importance, especially over the last years, speech enhancement has been an important research topic due to the fact that it is required in many applications in the daily life. Speech enhancement and noise reduction aim to improve the speech quality, intelligibility and overall perceptual clarity of a noisy signal by removing the unwanted noise using several techniques. The traditiona...
متن کامل